CDS

Accession Number TCMCG075C09044
gbkey CDS
Protein Id XP_007037111.2
Location complement(join(18338150..18338617,18340196..18340579,18349319..18349543,18349629..18350021))
Gene LOC18604523
GeneID 18604523
Organism Theobroma cacao

Protein

Length 489aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007037049.2
Definition PREDICTED: uncharacterized protein LOC18604523 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Nucleoporin protein Ndc1-Nup
KEGG_TC 1.I.1
KEGG_Module M00427        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
KEGG_ko ko:K14315        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03013        [VIEW IN KEGG]
map03013        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCATTCAGCGGAGTCAGGCATGGTAGTGAAGAATCGGTTGGTAGAGTTCCTAATATGGCAGTCCATTCCAAGTAGTTTAATTTTCTTGGTTTTCACCGCAATGATCGTCTCTGGGCGCTCCCCCGCTGCGATCTTCATCTCCTTTTTGAGCTTTCATCTCTCCCAGCTGCTCTTCTCTGTCTCCCTCTCAGCCGTCTCATCTCCCGAGCGTAAATTTGGACGCAGTTTGCCGGTTATGCTGGGAGCCGCGGCCGTGTCGGGTTATGTCTCGGCAGTGTCCCTTTGTGGAGTGAATGGGAGAGTGGGGTTTAAGGGTTTTGCTTCCGGGTTGTTCTATGCCTTCTTTTATATATATAAGCGGAGATGGGTATTGCATTTTCCCATTATTCAGCGTTCCCCTTTCTTTAGCTTCAAGATGGGGATCCCTTCCGCTATCACACGAGCATTAAAGCTTTCTGCTGCAGCTTATCTATTTTCAGCTCTGCTGTTGGTATTTCTGCCACACCATTTTAACACTGAACTCGAGCTGGGAAACTTGTTTGCTCAACATGTAATCTCTTATTCTGTCAGCTTTTCTCTGTTTCTCTGTTGGGAATTAGCTCATCATTTACAACAGGTGCTACATACAAAAAGGTTCATATTCGCACCACCCAAAGGATCGGCAGCAGCAGAAACAAATCCAAGTGAGCCTCTCCTGGCTGCATTAGAGGAGAGTTCTCCAACTTCCCTTCTGAAATATCTTGCATACCTTGATCTATGTATGGTTTGTGAGAATAATGTTGACTATTGGCGTCGAGCTGCCTTCTTTGAAGAAACTGGTGAGACTTACAGAAGAGTTGCAGCTGTATGCTTGCGGCCTTTGGAGCAGCTTGCATCAAAATTGGGCGAAGGTTTGGAAGGTTCTTCAGATGGTAAAGCCTACCGAGTATCTGACCAGTTGCAGTCATCAACTGACCCACGAATGAATTCAAAATGCTATGAACTAATGAACAACTTCCAGCTTTACACATGGTCTGCTCGAACAATTGCATCCTTGACTGCACACTCACATAAAGAAGACAGATTTGGAGTTGCTCAACTCTCTGGTAGCAATGCTGCTGTTATCTCGACACTCATAGCTTGCCTGCTTGCTGTTGAAACATTCATGGGAAAAAAATCAAGTTTGCAACCATCTCCGCATTTGATGGGCCCAGCTGGCATTAAATGGGCTACATCGAGTATTGGAAGAAGAGATGTTAGAACAGGTAAAAGGAGAGACGGTCCACTTTATTCAAAAGCATATGCCATGGCTGATGTTTTAAGGACGTCAATTTACTGCATTGTGTCTGCTTTTCACAACGAGATGCTGACTAACGCCAAAGCTGGCCTTCTTGAGAAGGATTGGATTAGTAGTGGCAAACCTCCTTTTGGAACTCGCGAGCTGCTGTTGCAGAAATTGCTTCTTTTTCTTGACTTTCAAGCCAGCTAA
Protein:  
MHSAESGMVVKNRLVEFLIWQSIPSSLIFLVFTAMIVSGRSPAAIFISFLSFHLSQLLFSVSLSAVSSPERKFGRSLPVMLGAAAVSGYVSAVSLCGVNGRVGFKGFASGLFYAFFYIYKRRWVLHFPIIQRSPFFSFKMGIPSAITRALKLSAAAYLFSALLLVFLPHHFNTELELGNLFAQHVISYSVSFSLFLCWELAHHLQQVLHTKRFIFAPPKGSAAAETNPSEPLLAALEESSPTSLLKYLAYLDLCMVCENNVDYWRRAAFFEETGETYRRVAAVCLRPLEQLASKLGEGLEGSSDGKAYRVSDQLQSSTDPRMNSKCYELMNNFQLYTWSARTIASLTAHSHKEDRFGVAQLSGSNAAVISTLIACLLAVETFMGKKSSLQPSPHLMGPAGIKWATSSIGRRDVRTGKRRDGPLYSKAYAMADVLRTSIYCIVSAFHNEMLTNAKAGLLEKDWISSGKPPFGTRELLLQKLLLFLDFQAS